SF-Sketch: A Two-Stage Sketch for Data Streams
نویسندگان
چکیده
منابع مشابه
SF-sketch: slim-fat-sketch with GPU assistance
A sketch is a probabilistic data structure that is used to record frequencies of items in a multi-set. Various types of sketches have been proposed in literature and applied in a variety of fields, such as data stream processing, natural language processing, distributed data sets etc. While several variants of sketches have been proposed in the past, existing sketches still have a significant r...
متن کاملPyramid Sketch: a Sketch Framework for Frequency Estimation of Data Streams
Sketch is a probabilistic data structure, and is used to store and query the frequency of any item in a given multiset. Due to its high memory efficiency, it has been applied to various fields in computer science, such as stream database, network traffic measurement, etc. The key metrics of sketches for data streams are accuracy, speed, and memory usage. Various sketches have been proposed, but...
متن کاملA Sketch-based Clustering Algorithm for Uncertain Data Streams
Due to the inaccuracy and noisy, uncertainty is inherent in time series streams, and increases the complexity of streams clustering. For the continuous arriving and massive data size, efficient data storage is a crucial task for clustering uncertain data streams. With hash-compressed structure, an extended uncertain sketch and update strategy are proposed to store uncertain data streams. And ba...
متن کاملA Two-stage Equilibrium Travel Demand Model for Sketch Planning
1 This paper describes a two-stage equilibrium travel demand model. The unique feature of this model is 2 that it takes time-of-day traffic counts instead of land use and demographic data as inputs to derive spatial 3 and temporal travel demand patterns. The first stage of the model is a traffic count-based trip matrix 4 estimator; the second stage is an elastic-demand network flow estimator, w...
متن کاملSequential Pattern Mining for Uncertain Data Streams using Sequential Sketch
Uncertainty is inherent in data streams, and present new challenges to data streams mining. For continuous arriving and large size of data streams, modeling sequences of uncertain time series data streams require significantly more space. Therefore, it is important to construct compressed representation for storing uncertain time series data. Based on granules, sequential sketches are created t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Parallel and Distributed Systems
سال: 2020
ISSN: 1045-9219,1558-2183,2161-9883
DOI: 10.1109/tpds.2020.2987609